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Abstract 

Submodular function maximization has been studied extensively in recent years under various 
constraints and models. The problem plays a major role in various disciplines. We study a 
natural online variant of this problem in which elements arrive one-by-one and the algorithm 
has to maintain a solution obeying certain constraints at all times. Upon arrival of an element, 
the algorithm has to decide whether to accept the element into its solution and may preempt 
previously chosen elements. The goal is to maximize a submodular function over the set of 
elements in the solution. 

We study two special cases of this general problem and derive upper and lower bounds on 
the competitive ratio. Specifically, we design a 1/e-competitive algorithm for the unconstrained 
case in which the algorithm may hold any subset of the elements, and constant competitive ratio 
algorithms for the case where the algorithm may hold at most k elements in its solution. 
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1 Introduction 


Submodular function maximization has been studied extensively in recent years under various con¬ 
straints and models. Submodular functions, which capture well the intuitive idea of diminishing 
returns, play a major role in various disciplines, including combinatorics, optimization, economics, 
information theory, operations research, algorithmic game theory and machine learning. Sub¬ 
modular maximization captures well known combinatorial optimization problems such as: Max- 
Cut [32l ESI ESI 0OJ [53], Max-DiCut [20] [32] [53] . Generalized Assignment [HJ [121 EH ES], and 
Max-Facility-Location la nil nsi- Additionally, one can find submodular maximization problems 
in many other settings. In machine learning, maximization of submodular functions has been 
used for document summarization @51 [47], sensor placement mmm, and information gather¬ 
ing [25] • In algorithmic game theory, calculating market expansion m and computing core values 
of certain types of games [52] are two examples where the problem can be reduced to submodular 
maximization. 

It is natural to consider online variants of submodular maximization, such as the following set¬ 
ting. There is an unknown ground set of elements A f = {u\,U 2 , • • •, u n }, a non-negative submodular 
function / : 2^ —>• M + and perhaps a constraint determining the feasible sets of elements that may 
be chosen. The elements of J\f arrive one by one in an online fashion. Upon arrival, the online 
algorithm must decide whether to accept each revealed element into its solution and this decision 
is irrevocable. As in the offline case, the algorithm has access to the function / via a value oracle, 
but in the online case it may only query subsets of elements that have already been revealed. For 
a concrete example, consider a soccer team manager preparing his team for the 2014 World Cup 
and hiring k (= 11) players. Occasionally, the manager gets an additional application from a new 
potential player, and may recruit him to the team. The goal is to maximize the quality of the team. 
The quality of the team is indeed a (non-monotone) submodular function of the chosen players. 

Although natural, it is not difficult to see that no algorithm has a constant competitive ratio 
for the above naive model even with a simple cardinality constraint. Therefore, in order to obtain 
meaningful results, one must relax the model. One possible relaxation is to consider a random 
arrival model in which elements arrive at a random order mmm- This relaxation leads to 
“secretary type” algorithms. We propose here a different approach that still allows for an adversarial 
arrival. Here, we allow the algorithm to reject (preempt) previously accepted elements. Preemption 
appears as part of many online models (see, e.g., BOH USED]). The use of preemption in our 
model is mathematically elegant and fits well with natural scenarios. For example, it is natural 
to assume that a team manager may release one of the players (while hiring another instead) if it 
benefits the quality of the team. 

1.1 Our Results 

We study online submodular maximization problems with two simple constraints on the feasible 
sets. The unconstrained case in which no restrictions are inflicted on the sets of elements that the 
algorithm may choose, and the cardinality constraint in which the online algorithm may hold at 
any time at most k elements. We provide positive results as well as hardness results. Our hardness 
results apply both to polynomial and non-polynomial time algorithms, but all our algorithms are 
polynomial (in some of them, a polynomial time implementation losses an e in the competitive 
ratio). 

The unconstrained case: The first special case we consider is, arguably, the most basic case in 
which there is no constraint on the sets of elements that may be chosen. In this case the problem is 
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trivial if the submodular function / is monotonfQ as the algorithm may take all elements. Therefore, 
we only consider non-monotone functions /. One simple candidate algorithm is an algorithm that 
selects each revealed element independently with probability 1/2. Feige et al. [23] proved that this 
algorithm, which does not use preemption, is 1/4-competitive. We prove that this is essentially 
optimal without using the additional power of preemption. 

Theorem 1.1. An online algorithm using no preemption cannot be (1/ 4 + e)- competitive for any 
constant e > 0, even if f is guaranteed to be a cut function of a weighted directed graph. 

On the positive side we prove that preemption is beneficial for the unconstrained-model. 

Theorem 1.2. There exists a 1/ e-competitive algorithm for the unconstrained-model, which can 
be implemented in polynomial time at the cost of an e loss in the competitive ratio (for an arbitrary 
small constant e > 0). 

A special case of the unconstrained-model that we study is the dicut-model. In this model 
every set is feasible and the objective function / is a cut function of a weighted directed graph 
G = (V, A) having AlCl (i.e., a subset of the nodes of G form the ground set). We assume an 
algorithm for the dicut-model knows V (but not Af) and can query the weight (and existence) of 
every arc leaving a revealed node. Observe that the algorithm can in fact calculate the value of / 
for every set of revealed elements using this information and never needs to query / via the oracle. 

The dicut-model can be viewed as an online model of the well-known Max-DiCut problem 
(see Section P for a discussion of another online model of Max-DiCut). Additionally, since the 
dicut-model is a special case of the unconstrained-model (with more power for the algorithm), 
it inherits the positive result given by Theorem 11.21 The next theorem gives a stronger result. 

Theorem 1.3. There exists a polynomial time 0A38-competitive algorithm for the dicut-model. 

Theorem 1 1.31 is proved by showing that an offline algorithm suggested by m can be implemented 
under the dicut-model. We complement Theorems 11.21 and 11.31 bv the following theorem which 
gives hardness results for the dicut-model (and thus, also for the unconstrained-model). 

Theorem 1.4. No randomized (deterministic) algorithm for the dicut-model has a competitive 
ratio better than 4/5 ( 5 ~^ ~ 0.438/J! 

Notice that for polynomial time algorithms a hardness result of 1/2 proved by [23] for offline 
algorithms extends immediately to the unconstrained-model. For the dicut-model there exists a 
polynomial time 0.874-approximation offline algorithm [15], thus, polynomial time offline algorithms 
are strictly stronger than online algorithms in this model. 

Cardinality constraint: The second case we consider is the cardinality-model in which a set 
is feasible if and only if its size is at most k, for some parameter 0 < k < n. Our positive results 
for the cardinality-model are summarized by the following theorem. 

Theorem 1.5. There exists a randomized (56/627 ~ 0.0893/ -competitive algorithm for the cardi¬ 
nality-model, which can be implemented in polynomial time at the cost of an e loss in the compet¬ 
itive ratio (for an arbitrary small constant £ > 0). Moreover, if the objective function is restricted 
to be monotone, then there exists a polynomial time deterministic (1 / 4)-approximation algorithm 
for this model. 

1 A set function / : 2 ^ —> R is monotone if A C B C N implies f{A) < f(B). 

2 Theorem [PI holds even if we allow the algorithm access to all arcs of G, including arcs leaving unrevealed 
elements. 


2 





Monotone Objective 

General Objective 

Deterministic 

Randomized 

Deterministic 

Randomized 

Algorithm 

1/4 mm 

1/4 [HEU 

- 

^ « 0.0893 El 

Hardness 

1/2 + e OD 

3/4 + e (11.61) 

5 -^ « 0.438 dH4j) 

1/2+ £ El 

Polynomial 
time hardness 

1/2 + e GU) 

1 — 1/e [5D] 

5 "^ w 0.438 (El 

0.491 EH 


Table 1: Summary of the results for the cardinality-model. To the right of each result appears 
the number of the theorem (or references) proving it. 


The second part of Theorem 11.51 follows from the works of Ashwinkumar [54] and Chakrabarti 
and Kale Ena • More specifically, these works present a streaming 1/4-competitive algorithm for 
maximizing a monotone submodular function subject to a cardinality constraint]! and this al¬ 
gorithm can be implemented also under the cardinality-model. We describe a different 1/4- 
competitive algorithm for monotone objectives under the cardinality-model, and then use addi¬ 
tional ideas to prove the first part of Theorem 11,51 

It is interesting to note that both algorithms guaranteed by Theorem 11.51 can be implemented 
in the streaming model of Chakrabarti and Kale m- Thus, we give also the first algorithm for 
maximizing a general non-negative submodular function subject to a cardinality constraint under 
this streaming model. 

On the negative side, notice that the cardinality-model generalizes the unconstrained-model 
(by setting k = n). Hence, both hardness results given by Theorem 11.41 extend to the cardinal¬ 
ity-model. The following theorem gives a few additional hardness results for this model. 

Theorem 1.6. No algorithm for the cardinality-model is (l /2 + e)-competitive for any constant 
e > 0. Moreover, even if the objective function is restricted to be monotone, no randomized (deter¬ 
ministic) algorithm for the cardinality-model is (3/4 + e)-competitive /(1/2 + e)-competitive). 

Notice that polynomial time hardness results of 0.491 and 1 — 1/e for the cardinality-model 
and the cardinality-model with a monotone objective follow from m and [50], respectively. All 
the results for the cardinality-model are summarized in Tabled] 

1.2 Related Work 

The literature on submodular maximization problems is very large, and therefore, we mention below 
only a few of the most relevant works. The classical result of ISU states that the simple discrete 
greedy algorithm is a (1 — l/e)-approximation algorithm for maximizing a monotone submodular 
function subject to a cardinality constraint. This result is known to be tight [50j, even in the case 
where the objective function is a coverage function [19]. However, when one considers submodular 
objectives which are not monotone, less is known. An approximation of 0.309 was given by [56]. 
which was later improved to 0.325 [3TJ using a simulated annealing technique. Extending the 
continuous greedy algorithm of [9] to general non-negative submodular objectives, [28] obtained 
an improved approximation of 1/e — o(l). Finally, [25] gave a fast 1/e-approximation algorithm 
and a much slower (1/e + 0.004)-approximation algorithm, demonstrating that 1/e is not the right 
approximation ratio for the problem. On the hardness side, it is known that no polynomial time 
algorithm can have an approximation ratio better than 0.491 [31] . 

3 In fact the algorithm of 22 an d HQ] is (4m) _1 -competitive for the more general constraint of m-matroids 
intersection. 
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For the problem of maximizing a non-negative submodular function subject to no constraints, 
the first approximation algorithm was presented by Feige et al. [23j who gave a 2/5-approximation. 
This was improved in m and [27] to 0.41 and 0.42, respectively. Finally, [8] described a 1/2- 
approximation linear time algorithm, matching the hardness result given by [23] . Huang and 
Borodin m study an online model of this problem where the algorithm can access both / and /. 
This model is powerful enough to implement the randomized 1/2-approximation algorithm of [5], 
and therefore, m consider only deterministic algorithms. Azar et al. [3] consider the Submodular 
Max-SAT problem, and provide a 2/3-approximation algorithm for it that can be implemented 
also in a natural online model. We are not aware of any additional works on online models of 
submodular maximization. 

For the problem of Max-DiCut, Goemans and Williamson [32] obtained 0.796-approximation 
using semi-definite programming. This was improved through a series of works [201B51I48] to 0.874. 
On the hardness side, a (12/13 + ^-approximation algorithm will imply P = NP [36]. Assuming the 
Unique Games Conjecture, the best possible approximation for Max-Cut is 0.878 [US [39], and this 
hardness result holds also for Max-DiCut since the last generalizes Max-Cut. 

Bar-Noy and Larnpis [6] gave a (l/3)-competitive deterministic algorithm for an online model of 
Max-Cut where every revealed node is accompanied by its input and output degrees. For the case 
of a directed acyclic graph, they provide an improved deterministic algorithm with a competitive 
ratio of 2/3 1 " 5 ~ 0.385, which is optimal against an adaptive adversary. Huang and Borodin [37] 
notice that the (l/3)-competitive deterministic algorithm of [6] is in fact identical to the (1/3)- 
approximation deterministic algorithm of [8] for unconstrained submodular maximization. Using 
the same ideas, it is not difficult to show that the (l/2)-approximation randomized algorithm 
of [8] implies a (l/2)-competitive algorithm in this online model. Finally, Feige and Jozeph [21] 
consider oblivious algorithms for Max-DiCut—algorithms in which every node is selected into the 
cut independently with a probability depending solely on its input and output degrees. They 
show a 0.483-approximation oblivious algorithm, and prove that no oblivious algorithm has an 
approximation ratio of 0.4899. 

Finally, the vast literature on buyback problems considers problems that are similar to our 
model, but assume a linear objective function. Many of these problems are non-trivial only when 
preemption has a cost, which is usually assumed to be either constant or linear in the value of 
the preempted element. The work from this literature most closely related to ours is the work of 
Babaioff et al. [im who considered a matroid constraint. For other buyback results see, e.g., da 

saisa isa ns]. 

Paper Organization. Section [2] defines additional notation. Section [3] gives our results for the 
unconstrained-model and the dicut-model, except for Theorem 11.11 whose proof is deferred to 
Appendix [A] Finally, Section [4] describes our results for the cardinality-model. 

2 Preliminaries 

We study the following online variant of the submodular maximization problem. There is an 
unknown ground set of elements A f = {u\, U 2 , ■ ■ ■, u n }, a non-negative submodular function / : 
2^ —> M + and and a down-monotonc0 collection of feasible sets X C 2^. The objective of the 
instance is to find a feasible set maximizing /. The elements of J\f are revealed one by one. The 
algorithm creates n + 1 feasible solutions: Sq, S\,..., S n (each S) € X). The solution So is the 
empty set 0. For every 1 < i < n, Si is the solution selected by the algorithm immediately after 

4 A collection X of sets is down-monotone if AC B C J\f and B £ X imply A £ X. 


4 



element Ui is revealed and the algorithm can choose it to be any feasible subset of Si -1 + njf|. It 
is important to note that the algorithm does not know n (the size of the ground set) in advance, 
hence, the input might end after every element from the algorithm’s point of view. 

3 The unconstrained-model and the dicut-model 

Our positive results for the unconstrained-model and the dicut-model (i.e., Theorems 11.21 
and 11.31) are proved in Section 13. II The negative result for the dicut-model (Theorem 11.41) . which 
applies also to the unconstrained-model, is proved in Section T3.21 

3.1 Algorithms for the unconstrained-model and the dicut-model 

Before describing our algorithm for the unconstrained-model, we need some notation. For two 
vectors x,y E [0,1]^, we use x V y and x A y to denote the coordinate-wise maximum and mini¬ 
mum, respectively, of x and y (formally, (x V y) u = ma x{x u , y u } and (x A y) u = min {x u ,y u }). We 
abuse notation both in the description of the algorithm and in its analysis, and unify a set with its 
characteristic vector and an element with the singleton set containing it. The multilinear extension 
of a set function / : 2^ —> R + is a function F : [0,1]-^" —> M + defined by F(x) = E[/(R(x))], where 
R(x) is a random set containing every element u E M with probability x u , independently. The mul¬ 
tilinear extension is an important tool used in many previous works on submodular maximization 
(see, e.g., B US [56] 1. We denote by d u F(x) the derivative of F at point x with respect to the 
coordinate corresponding to u. It can be checked that F is a multilinear function, and thus: 

d u F{x) = F(x V u) — F(x A (A f — it)) . 

Consider Algorithm [U Recall that Si is the solution that the algorithm produces after seeing 
element itj. 


Algorithm 1: Marginal Choice 


1 

2 

3 

4 


foreach element Ui revealed do 

Choose a uniformly random threshold 0i E [0,1]. 
Let Mi E- {iti,it2, • • 

_ Let Si ■<— {uj € Mi | d Uj F(0j ■ Mi) > 0}. 


Our first objective is to show that Algorithm |T] is an online algorithm according to the unconst¬ 
rained-model. 

Lemma 3.1. For every 1 < i < n, Si C Si-\ + iij. 

Proof. By definition, S t contains only elements of A/). Fix an element Uj € Mi — u t . Then: 

Uj € Si =>■ d Uj F(9j ■ Mi) > 0 =>- d Uj F(9j ■ A/)-i) > 0 Uj £ Si -1 , 

where the second derivation follows from submodularity. □ 

Next, we bound the competitive ratio of Algorithm!!] Fix an element Ui E M. By submodularity, 
g(z) = d Ui F(z ■ M) is a continuous non-increasing function of z. Hence, by the intermediate value 

5 Given a set S and an element u, we use S + u and S — u as shorthands for S U {it} and S \ {it}, respectively. 
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theorem, one of the following must hold: g(z) is always positive in the range [0,1], g(z ) is always 
negative in the range [0,1] or g(z) has at least one root zq E [0,1]. In the last case, the set /o C [0,1] 
of the roots of g(z) is non-empty. Moreover, by the monotonicity and continuity of g(z), /o is a 
closed range. Using these observations, we define a vector y E [0, l]^ as follows: 



^ max Iq 


if d Ui F(0 ■ Af) < 0 , 
if d Ui F(l ■ Af) > 0 , 
otherwise . 


Remark: Notice that the case d Ui F (0 • Af) < 0 in the definition of y can happen only when 

f( 0 )> o. 

Observation 3.2. Every element Ui E Af belongs to S n with probability y Ui , independently. Hence, 
E[f(S n )]=F(y). 

Proof. An element m E Af belongs to S n if and only if d Ui F(9i ■ Af) > 0, which is equivalent to the 
condition < y Ui . Clearly, the last condition happens with probability y Ui , and is independent for 
different elements. □ 


The last observation implies that analyzing Algorithm [T] is equivalent to lower bounding F(y). 
Lemma 3.3. For every A E [0,1], F(y A (A • Af)) > F( A • Af). 

Proof. Observe that: 

F(y A (A W)) = /(0) + f = f(0) + [ Y, B u F(y A (z ■ N))dz , 

•'» iz Jo Fit 

z<Vu 

where the second equality is due to the chain rule. By submoduliry and the observation that 
d u F(z ■ Af) is non-positive for every z > y u , we get: 


Y, d - F (y a (z • Af)) > y d - F ( z • -ao > Y duF ( z • ^ • 

u£j\f u£j\f u£j\f 

z<y u z<y u 


Combining the above equality and inequality, and using the chain rule again, gives: 


F(yA(X-Af))>f(0)+ [ 

Jo 


Y d uF{z- Af) 
0 UeN 


dz = 


m + [ 

Jo 


dF{z ■ Af) 
dz 


dz = F(X- Af) . □ 


We also need a lemma proved by [28J. 

Lemma 3.4 (Lemma III. 5 of [28]). Consider a vector x E [0,1 ]-^”. Assuming x u < a for every 
u e Af, then for every set S C J\f, F(x V S) > (1 — a)f(S). 

The following corollary follows from the last two lemmata. 

Corollary 3.5. F(y) > e -1 • F(OPT), where OPT is the subset of Af maximizing f(OPT). 


6 







Proof. Fix two values 0 < a < b < 1. By the chain rule: 

rb dF(y A (z ■ Af)) 


F(j/A(6-AT))-F(j/A 


(a-Af)) = [ 

J a 


dz 


dz 


(1) 


^2 d u F(y A (z ■ Af))dz > / ^ d u F(z ■ Af)dz , 


u&M 

z<y-u. 


u&M 

z<y u 


where the inequality follows from submodularity. We use two ways to lower bound the rightmost 
hand side of the above inequality. First, observe that d u F(z ■ Af) > 0 whenever z < y u . Thus, 
F(y A (z - Af)) is a non-decreasing function of z. Additionally, d u F(z - Af) < 0 whenever z > y u and 
d u F{z ■ Af) > 0 whenever z < y u . This allows us to derive the second lower bound, which holds 
whenever z is not equal to any coordinate of y (i.e., for every value of z except for a finite set of 
values): 


Y, d u F(z-Af) > J2 d uF(z-M) 


Y, U £OPt[ f ( u v ( z ■ ■A/')) - F ( z ■ -A/")] 


ueM 

z<y u 


> 


u£ OPT 

F{OPT V (z ■ Af)) — F(z ■ Af) 
1 - z 


1 - z 


> f(OPT) - 


F{y A (z • Af)) 


1 - z 


where the last inequality follows from Lemmata 13.31 and 13.41 Plugging this lower bound and a = 0 
into © gives: 


F(yA(b-AT))>f(0) + 


i o L 


f{OPT) 


The solution of this differential equation is: 


F(yA(z-AT)) 
1 - z 


dz . 


F(y A (b ■ Af)) > -(1 - b) ■ ln(l - b) ■ f(OPT ) . 

The rightmost hand side of the last inequality is maximized for b = 1 — e _1 . Recalling that 
F(y A (z • Af)) is a non-decreasing function of z, we get: 

F(y) = F(y A (1 • Af)) > F(y A ((1 - e" 1 ) • Af)) > e" 1 • f(OPT) . □ 

The last corollary completes the proof of the first part of Theorem 11.21 A naive implementation 
of Algorithm [T| requires an exponential time to evaluate F. However, for every constant c > 0 it 
is possible to approximate, in polynomial time, d Uj F(8j ■ Aff) with probability 1 — csF 2 up to an 
additive error of csF 2 ■ f(OPT) using sampling (see [Sj for an example of a similar estimate) Jf] 
For a small enough c, such an approximation is good enough to affect the competitive ratio of the 
algorithm by only an e. To keep the algorithm within the unconstrained-model, the samples used 
to approximate d Uj F(9j - Aff) for different values of i need to be correlated to guarantee that the 
approximation is a decreasing function of i. For more details, see Appendix [Bj 

The rest of this section considers the dicut-model and is devoted for proving Theorem 11,31 
Recall that in this model / is the cut function of some weighted directed graph G = (V,A). Let 
Wi n (u) (w ou t(u)) denote the total weight of the arcs entering (leaving) element u. Throughout this 
section we assume there are no arcs of G leaving nodes of V \ Af. Removing such arcs does not 
affect the value of any solution, and thus, our results hold also without the assumption. Feige and 
Jozeph [22] proved the following theorem (we rephrased it in our notation). 

6 It is not possible to decrease the error to something that depends on n because n is unknown to the algorithm 
when it calculates these estimations. 
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Theorem 3.6. There exists a non-decreasing function h : [0, 1] —>• [0, 1] such that the vector 
y € [0,1]-^ defined by: 

w OVLt (u) 

w in (u) + w out (u) 

obeys F(y ) > 0.483 • f(OPT). Moreover, h is independent of G and can be computed in constant 
time (assuming numbers comparison takes constant time). 

Notice that in the last theorem y u is undefined when w- m (u) = w OVLt (u) = 0. The theorem holds 
regardless of the value used for y u in this case because such an element u is isolated. We can now 
present our algorithm for the di cut-model which is depicted as Algorithm [2j 



Algorithm 2: Degrees Ratio Choice 


l foreach element m revealed do 


2 

3 

4 

5 


Choose a uniformly random threshold € [0,1]. 


Initialize Si 


0 . 


for j = 1 to i do 

Let i) denote the total weight of arcs from revealed elements to Uj. 


if Wi n (uj,i) + w ou t(uj) > 0 then Let y Uj (i ) 
y Uj (i) -s- 1. 

if y Uj (i) > 0j then Add Uj to S'*. 


h 


Worn (Uj ) 

Win(wj ,i)+^out ffij ) 


else Let 


Observation 3.7. The competitive ratio of Algorithm^ is at least 0.483. 

Proof. Notice that the vector y(n) defined by the algorithm is equal to the vector y defined by 
Theorem 13.61 (wherever the last is defined). Notice also that every element u E fif belongs to S n 
with probability y u (n), independently. Hence, E[/(S' n )] = F(y). The observation now follows from 
Theorem 13.61 □ 

To complete the proof of Theorem 11.31 we only need the following lemma which shows that 
Algorithm [2] is an online algorithm of the dicut-model. 

Lemma 3.8. For every 1 < i < n, Si C S t -\ + u t . 

Proof. For i = 1 there is nothing to prove since Si C {ui}. Thus, we assume from now on i > 2. 
By definition, Si contains only elements of A/). Fix an element Uj € A/) — u%. We have to show that 
Uj € Si- 1- First, let us prove that: 


y Uj (i) < y Uj {i ~ 1 ) • ( 2 ) 

There are two cases to consider. If w Ui (uj,i — 1 ) + w out (uj) < 0 then y Uj {i — 1) = 1, which 
proves m since y Uj (i) € [0,1]. Otherwise, since w in (uj,i) is a non-decreasing function of i we get 
W{ n {uj,i) + w ou t(uj) > 0. Inequality ([2]) now follows since h is also non-decreasing. Using © we 
now get: 


Uj € Si y Uj (i) P 0j =7^ y Uj (i 1) P 9j Uj G Si—\ 


□ 










Figure 1: Graph for Lemma 13.91 The number beside each arc represents its weight. 

3.2 Hardness results for the dicut-model 

In this section we prove Theorem 11.41 The proof of the theorem is split between two lemmata. 
Lemma 13.91 proves the part of Theorem 11.41 referring to deterministic algorithms, and Lemma 13.101 
proves the part referring to randomized algorithms. Both lemmata present an absolute graph 
G = ( V,A ), and then fix an algorithm ALG and describe an adversary that reveals some of the 
nodes of V. The choice which nodes to reveal is done iteratively, i.e., the adversary initially reveals 
some nodes and then can reveal additional nodes based on the decisions of ALG (or the probabilities 
of various decisions in case ALG is randomized). Formally, in the hard instance for ALG , the set 
J\f is the set of nodes revealed by the adversary, and the hard instance reveals these nodes in the 
same order they are revealed by the adversary. 

Lemma 3.9. No deterministic online algorithm has a competitive ratio better than (5 — y/YI)/2 
for the dicut-model. 

Proof. Consider the directed graph G describe in Figured! and fix a deterministic algorithm ALG 
for the dicut-model. We describe an adversary that reveals some of the nodes of G and forces 
ALG to be no better than ((5 — \/l7)/2)-competitive. For every pair of nodes x and y of G, we 
denote by c(xy) the weight of the arc from x to y. Our adversary begins by revealing u\ and U 2 
and stops if ALGA solution at this point is either empty or equal to {ui,u 2 }. If ALGA solution is 
empty then ALG is not competitive at all. On the other hand, if ALGA solution is { 111 , 112 } then 
its solution has a value of: 


c(uivi) + c(u 2 u 2 ) = 2 • 


y/T? — 3 
4 


Vrr -3 

2 


On the other hand, the optimal solution is { 14 }, whose value is: 


c(u\vi) + c(ui« 2 ) 


V17 -3 vTr + i 

4 4 


Thus, in this case, the competitive ratio of ALG is at most: 


VTf — 3 _ Wf+1 _ 2/17-6 _ 5-/7 
2 ' 4 y/Yf+1 ~ 2 

By the above discussion, the only interesting case is when ALGA solution contains exactly one 
of the elements u\ or u 2 . Notice that G is symmetric in the sense that switching every node having 
the index 1 with the corresponding node having the weight 2 does not change the graph. Hence, it 
is safe to assume ALGA solution is exactly {«i}. The next step of the adversary is to reveal v\. If 
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Figure 2: Graph for Lemma 13.101 All arcs have an identical weight assumed to be 1. 


v\ enters the solution of ALG , then, regardless of weather u\ is kept in the solution, the value of 
ALG 's solution is c(u\U2 ) = c{y\U\) = 1. On the other hand, the optimal solution at this point is 
{u 2 ,v{\ whose value is: 


C{U2U\) + c(u 2 V2) + c(y\ui) 


1 + 


y/Vf-3 

4 


+ 1 


Hence, the competitive ratio of ALG is at most: 


\/l7 + 5 
4 


4 _ 5-y/Vt 

VT7 + 5 ~~ 2 

We are left to handle the case in which v± does not enter ALG 's solution. In this case, the 
adversary reveals also w\. ALG 's solution at this point must be a subset of and every 

such subset has a value of at most: 


c(uivi) + c(u\U2) = c(w±ui) 


y/Vf+1 

4 


On the other hand, the optimal solution at this point is {u 2 ,vi,w\} whose value is: 


c(U2Ul) + c(u 2 V2) + c(v\Ul) + c(wiUl) = 1 + 
Hence, the competitive ratio of ALG is at most: 

y/V7+l _ y/Vf + 3 _ y / l7+ 1 
4 ' 2 _ 2\/l7 + 6 


V17-3 1 Vtf+1 

- 1 -+ 1 + - 1 - 

4 4 

7-V17 5 — VT7 

=---<-^- 


Vl7 + 3 
2 


□ 


Lemma 3.10. No randomized online algorithm has a competitive ratio better than 4/5 for the 

dicut-model. 

Proof. Consider the directed graph G describe in Figure O and fix an algorithm ALG for the 
dicut-model. We describe an adversary that reveals some of the nodes of G and forces ALG to 
be no better than 4/5-competitive. The adversary begins by revealing the nodes u\ and 112 of the 
graph. At this point, let p\ be the probability that u\ is alone in the solution of ALG and P 2 be 
the probability that 112 is alone in this solution. If pi + P 2 < 4/5 then the adversary stops here, 
and the competitive ratio of ALG is no better than 4/5 since the optimal solution at this point has 
a value of 1 and the expected value of the ALG' s solution is only p\ + P 2 - Thus, we may assume 
from now on p\ + P 2 > 4/5. 

By symmetry, we may also assume, without loss of generality, P 2 < p\ , which implies: 2p\ > 
P 1 +P 2 > 4/5. The next step of the adversary is revealing w\. The optimal solution at this point is 
{« 2 ,ud}, whose value is 2. Let analyze the best solution ALG can produce. With probability pi, 
ALG must pick a subset of and thus, cannot end up with a solution of value better than 

1. With the remaining probability, ALG produces a solution no better than the optimal solution, 
i.e., its value is at most 2. Hence, the expected value of ALG' s solution is at most: 

pi + 2(1 - pi) = 2 - pi < 2 - 2/5 = 2 ■ (4/5) . □ 
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4 The cardinality-model 

Our positive and negative results for the cardinality-model (i.e., Theorems 11.51 and 11.61 respec¬ 
tively) are proved in Sections 14.11 and 14.21 respectively. 

4.1 Algorithms for the cardinality-model 

We first consider the special case of the cardinality-model where the objective function / is 
monotone (in addition to being non-negative and submodular). For this case it is possible to derive 
a 1/4-competitive algorithm from the works of Ashwinkumar [54] and Chakrabarti and Kale [10] on 
streaming algorithms. We describe an alternative algorithm which achieves the same competitive 
ratio and has an easier analysis. A variant of this algorithm is applied later in this section to the 
general cardinality-model. 

Our alternative algorithm has a parameter c > 0 and is given as Algorithm [3j Intuitively the 
algorithm gets the first k revealed elements unconditionally. For every element it, arriving after the 
first k elements, the algorithm finds the best swap replacing an element of the current solution with 
Ui. If this swap increases the value of the current solution by enough to pass a given threshold, 
then it is carried out and Ui enters the solution. 


Algorithm 3: Greedy with Theshold(c) 


1 Let So <r- 0. 

2 foreach element m revealed do 


3 

4 

5 

6 

7 

8 

9 

10 


if i < k then 

|_ Let Si <- Si-i + Ui. 

else 

Let it' be the element of Si -1 maximizing /(Si_ 1 + Ui — vf). 
if f(Si -1 + Ui- it') - f(Si- 1) > c • f(Si-i)/k then 
|_ Let Si <- Si-i +Ui~ it', 
else 

|_ Let Si Si-i. 


Let us define some notation. For every 0 < i < n, let A t = U* =1 5j. Informally, A % is the set of 
all elements of {iti,it2, ■ ■ ■ ,Ui} originally accepted by Algorithm (3[ regardless of whether they are 
preempted at a later stage or not. We also use f(u \ S ) to denote the marginal contribution of an 
element u to a set S. Formally, /(it | S) = f(S + it) — f(S). The following technical lemma is used 
later in the proof. 


Lemma 4.1. For every k <i <n, /(Si-i + Ui - it') - f(Si-i) > f{m \ A_i) - f(Si-i)/k. 
Proof. By the choice of it' and an averaging argument, 

if ( s i-i + u i~ u> ) - /OVi)] 


f(Si-l + Ui — u'i) — f(Si-l) > 


k 


> Hu'eSi-AfiSj-i+Uj) ~ f(Si-i)} Zu’eSj-Afi Si -1 - «') - /(^-1)] 


k 


k 


where the second inequality follows from submodularity. The first term on the rightmost hand side 
is f(ui | Si-i), which can be lower bounded by /(it* | A_i) because S t -\ C j. The second term 
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on the rightmost hand side can be lower bounded using the submodularity and non-negativity of 
/ as follows: 


- u ') - /( 5 -i)l ^ /(0) - /(5i_i) ^ /(5i_i) 

k k - k 

The following lemma relates Si and Aj. 

Lemma 4.2. For every 0 < i < n, /(Si) > • f(Ai). 

Proof. For i = 0, the lemma clearly holds since f(Ao) = /(0) = /(So). Thus, it is enough to prove 
that for every 1 < i < n, f(Si ) — f(Si- 1 ) > yfy • [f(Ai) — f(Ai- 1 )]. If the algorithm does not accept 
Ui then Si = Si -1 and Ai = Ai- i, and the claim is trivial. Hence, we only need to consider the 
case in which the algorithm accepts element m. 

If i < k then /(Si) - /(S*_i) = f(ui \ Si- 1 ) > f(ui \ Ai- 1 ) = f{Af) - /(A_i), where 
the inequality follows from submodularity since Si-\ C Ai-\. If i > k, then it is possible to 
lower bound f(Si) — f(Si- 1 ) = f(Si- 1 + Ui — u'f) — /(S*_i) in two ways. First, a lower bound of 
f(ui | Ai- 1 ) — f(Si-i)/k is given by Lemma f4.ll A second lower bound of c • f{Si-\)/k follows 
since the algorithm accepts Ui. Using both lower bounds, we get: 

f(Si) - f(Si- 1) > max{/(uj | Ai- 1) - f(S i - 1 )/k,c■ f(S i - 1 )/k } 

> c • [f(uj | Ai- y) - /(Sj-i)/fc] + 1 • [c • /(Sj-i)/fc] 

_ c + 1 

> -4t • /(«* | A,_!) = • [/(A*) - /(Ai_i)] . □ 

c + 1 c + 1 

We are now ready to prove the competitive ratio of Algorithm 02 and thus, also the second part 
of Theorem 11.51 

Corollary 4.3. The competitive ratio of Algorithm 0 is at least Hence, for c = 1 the 

competitive ratio of Algorithm^ is at least 1/4. 

Proof. Let OPT be the optimal solution, and consider an element Ui € OPT \ A j. Since u ? ; was 
rejected by Algorithm 02 the following inequality must hold: 

c-f(Si-1 ) , , /Cf N ^ ^ | A \ f(Si-i) 

- j: -> f{Si -1 + Ui - Ui) - f(Si- 1) > f{Ui I Ai- 1)--— , 

where the second inequality follows from Lemma 14.11 Rearranging yields: 

f( Ui | Ai-i) < ■ /(5i_!) < ^ • f(S n ) , 

where the last inequality uses the monotonicity of /(Si) (as a function of i). In conclusion, by the 
submodularity and monotonicity of / and Lemma 14.21 

/(OPT) < f(A n ) + J2 f( U I A ") < /(An) + E (T' /(s ">) 

uGOPT\A n uGOPT\A n ' ' 

< /(A„) + (c + 1) ■ /(5 n ) < . /(5 n ) . □ 

c 
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At this point we return the focus to the general cardinality-model allowing the objective / 
to be non-monotone. For the algorithm and analysis we present we need the notation described 
at the beginning of Section [3j More specifically, we use the multilinear extension F of f and the 
assumption that a set S C J\f represents also its characteristic vector. As a first step, consider a 
variant of Algorithm [3] modified in two ways: 

• The value k is replaced with pk for an integer parameter p > 1. 

• The algorithm uses an auxiliary function g : 2^ —)• JR" 1 " instead of the real objective function 
/. The auxiliary function g is defined as g = F(p _1 ■ S ). 

Notice that the modified algorithm does not produce a feasible solution. Still, we are interested 
in analyzing it for reasons that are explained later. The proofs of Lemmata 14. II and 14.21 do not use 
the monotonicity of /. Hence, both lemmata still hold with pk and g taking the roles of k and 
/, respectively (notice that g is non-negative and submodular). The resulting lemmata after the 
replacements are given below. 

Lemma 4.4. For every pk <i <n, g(Si -1 + - u'f) - g(Si- 1) > g(ui | A;_i) - g(Si-i) / (pk). 

Lemma 4.5. For every 0 <i <n, g(Si) > ■ g{Af). 

We also need the following lemma of (23]. 

Lemma 4.6 (Lemma 2.3 of [23j). Let f : 2 ^ —>• R be submodular, A,B C A two (not necessarily 
disjoint) sets and A(p),B(q) their independently sampled subsets, where each element of A appears 
in A(j>) with probability p and each element of B appears in B(q) with probability q. Then 


E [f{A[p) U B(q))\ > (1 - p){l - q) ■ f(0) + p( 1 - q) ■ f(A) + (1 - p)q ■ f(B) + pq ■ f(A U B) . 
Using the above lemmata, it is possible to get a guarantee on g(S n ). 

Corollary 4.7. Let OPT be the optimal solution. The modified Algorithm^ guarantees: 

-nopT). 

(c + 1) (1 +p 1 c) 

Hence, for c = 7/4 and p = 3, g(S n ) > (56/627) • f(OPT). 

Proof. Consider an element u, € OPT \ Aj. Since Ui was rejected by Algorithm [3] the following 
inequality must hold: 

C ' l) r Q f\ / Q \ ^ / I A \ i) 

-;-> g(Si -1 + Ui - uj - g(Si- 1 ) > g[Ui \ A»_i)- - — , 

pk pk 

where the second inequality follows from Lemma 14.41 Rearranging yields: 

g(ui | Aj_i) < ■ g(Si-i) < ■ g(S n ) , 

where the last inequality uses the monotonicity of g(Si) (as a function of i) for i > pk. By the 
submodularity of g and Lemma 14.51 we now get: 

c + 1 


g(A n U OPT ) < g(A n ) + ^ g(m \ A n ) < g{A n ) + ^ 

ueOPT\A n udOPT\A n 

< g(A n ) +p-\c+ 1) • g(S n ) < (C+1)(1 c +P lg) • g(S n 
On the other hand, by Lemma 14.61 


pk 


g(Sn) 


g{A n U OPT ) = F(p~ l ■ OPT + p~ l • (A n \ OPT)) > p~\ 1 - p~^) • f(OPT) 


□ 
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A naive implementation of Algorithm [3] requires an exponential time to evaluate F. However, 
this can be fixed, at a loss of an arbitrary small constant e > 0 in the guarantee of Corollary 14.71 
See the above discussion about a polynomial time implementation of Algorithm [T] for the main 
ideas necessary for such an implementation. For more details, see Appendix [Cj 

Consider a collection of vectors {yi}™ =0 obtained from the sets {*Si}™ =0 via the following rule: 
Hi = p~ l ■ Si. The vectors {yi}^ = 0 form a fractional solution for the cardinality-model in the 
following sense. First, for every 0 < i < n, the sum of the coordinates of yi is at most k. Second, 
for every 1 < i < n, yi < yt-i V u, where the inequality is element-wise. To convert the vectors 
{l/iliLo an i n teg ra l solution {>S{}” =0 for the cardinality-model, we need an online rounding 
method. 

The rounding we suggest works as follows. For every 1 < £ < k, select a uniformly random value 
rg from the range {p{£ — 1) + 1, p(£ — 1) + 2,... , p£}. Intuitively, the values rg specify the indexes of 
the elements from the set {try}y=i that get into the rounded solution. Later elements can get into 
the rounded solution only if they “take the place” of one of these elements. More formally, we say 
that an element Uj € Si uses place a if there exists a series of indexes: j\, j-g ,..., j m such that: 

j = ji , u'j. = u ji _ 1 (i.e., u ji replaced u ji ^ 1 in SyJ and j m = a . 

The element Uj € Si gets into Si if and only if it uses a place a equal to rg for some £ € {1, 2,... , k}. 

Observation 4.8. For every 0 < i < n, at most one element of Si uses every place a € 
{1,2,... ,pk}. 

Proof. The observation follows immediately from the definition when i < pk because for every 
1 < j < i, the element Uj uses place j. For larger values of i we prove the observation by induction. 
Assume the claim holds for i — 1 > pk, and let us prove it for i. Since Ui got into Si, the place used 
by Ui is defined as the place used by u[. As Si = S t -\ + m — u\, every place is used at most once 
also in S'*. □ 

Corollary 4.9. The sets {5j}f =0 form a feasible online solution for the cardinality-model. 

Proof. Observation 14.81 implies immediately that \Si\ < k since only k places make elements of Si 
using them appear in S t . Observe also that the rule whether an element of Si gets into {Si} is 
independent of i. Hence, for every 1 < i < k, Si C S ? ;_i + m simply because Si C Si— i + u*. □ 

Next, we want to show that f(S n ) is as good approximation for f(OPT) as g(S n ). This is done 
by the following lemma. 

Lemma 4.10. E[/(5„)] > g(S n ). 

Proof. For every 1 < £ < k, let Rg be the set of elements of S n using places from {p(£— 1) + l,p{£ — 
1) + 2 ,p£}. By Observation 14.81 the size of Rg is at most p. Let Rg be a set containing the single 
element of Rg using the place rg, or the empty set if no element uses this place. By definition, 
Sn = ULl Rl- 

Recall also that given a vector x € [0,1]-^”, R(x) is a random set containing every element u € Ri 
with probability x u . Using this notation, 


5 ( < S n )=E[/(R(p- i -5 n ))]=E 


/ U R 0p -1 • Rt) 


\t =1 
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Hence, the lemma is equivalent to the inequality: 


E 



> E 


/ ( Q Hp 1 • R f) 


i =i 


(3) 


As a step toward proving Inequality (|3|) . fix 1 < i < k. and let D be a random subset of A f\Re 
with an arbitrary distribution. We use the notation vi,V 2 , ■ ■ ■ ,v p > to denote the elements of Rp 
( p' = \Rg\ < p), and R .gj to denote the set {vi,V 2 , ■ ■ ■, Vi} C Rg. By submodularity, 


E [/ {D e UR e )] =E [f(D e )] + 


> E [f(D e )} + 


zunf(vi\Dt)\ 


p 


Ef=i nf(vi I D e u R(p _1 • R e,i-i)\ 


P 


E [f (Dg U R(p -1 • Rg))] 


Inequality d3|) follows by repeated applications of the above inequality, once for every 1 < i < k. □ 

The first part of Theorem II.51 follows by combining the modified Algorithm [3] with the rounding 
method described above. 


4.2 Hardness results for the cardinality-model 

In this section we prove Theorem 11,61 The proof of the theorem is split between two lemmata. 
Lemma 14.111 proves the part of Theorem 11.61 referring to monotone objectives, and Lemma 14.121 
proves the part referring to general objective functions. 

Lemma 4.11. No deterministic (randomized) online algorithm has a competitive ratio better than 
1/2 + e (3/4 + e) for the cardinality-model, even if the objective function is restricted to be 
monotone. 

Proof. Let k = [(2e)^ 1 ] and consider the ground set A f = {ui}^ =l U{vi}^ =1 U{w} and the monotone 
submodular function / : 2^ —>• defined as follows: 

f(S) = \S n {ui}i = i| + min {k, |S n {wjEil + k ■ \S n {rc}|} . 

Intuitively, / gains a “point” for every element of the forms Uj and V{. The element w gives k 
“points”, but these are the same points of the elements of the form v l , i.e., Vi gives no “points” 
once w is in the solution. 

Fix an algorithm ALG , and set the element w to be revealed after all the other elements of 
J\f. Observe that as long as w is not revealed, there is no way for ALG to distinguish between 
the elements of {u t }/ =l and {ty}Z =|L . Thus, if ALG is deterministic, the set S 2 k is determined 
solely based on the order in which the elements are revealed. By choosing the right order, one can 
guarantee that C {'Ci}/ =1 , which implies that S 2 k+i is a subset of U {re}, and thus, has 

a value of at most k. On the other hand, the optimal solution is {uij^Zl U {w}, and its value is 
2k — 1. Hence, the competitive ratio of ALG is at most: 

k < ttl <-+£ . 

2k - 1 “ 2k ~ 2 

If ALG is randomized, S/fe depends also on the random choices of ALG. However, by symmetry, 
it is still possible to choose a revelation order for the elements of {ui}/ =l and {vi,}/ =i guaranteeing 
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that in expectation S 2k contains no more than k/2 elements of {iq}/ =1 ■ Thus, the expected value 
of f(S 2k+1 ) can be upper bounded by 3k/2, and the competitive ratio of ALG is no more than: 


3fc/2 ^ 3fc/2 + 1 ^ 3 
2k-1 “ 2k “ 4 +e 


□ 


Lemma 4.12. No online algorithm has a competitive ratio better than 1/2 + e for the cardinal¬ 
ity-model. 

Proof. Intuitively, the proof of Lemma 14.121 provides a weaker bound for randomized algorithms 
because the number of elements of the form Vi in the ground set is quite small. In this proof we fix 
this problem. Let k = |~£ _1 ~| and h = 2k- |~£ _1 ~|. Consider the ground set AT = {itj}^ =1 U{uj}^ =1 U{'i/;} 
and the non-negative submodular function / : 2^ -+ M + defined as follows: 


m 


| S'| if w£S , 

k + \S (~l {ui}!? =1 \ otherwise . 


Fix an algorithm ALG , and set the element iv to be revealed after all the other elements of 
Af. Observe that as long as w is not revealed, there is no way for ALG to distinguish between the 
elements of and {uj}/ =1 . Hence, by symmetry, it is possible to choose a revelation order for 

these elements guaranteeing that in expectation S k +h contains no more than k 2 /{k + h) elements of 
{ u i\i=v The final solution of ALG is a subset of S k +h + w. Since the size of S k +h is at most k, no 
element of S k +h + w has a negative marginal contribution, and thus, the value of ALGA solution 
can be upper bounded by f{S k +h + w). Hence, the total expected value ALG achieves is at most: 

E[f(S k+h + «,)] = k + EOSfe+fe n {uij^iW <k + . 


On the other hand, the optimal solution is U {w}, and its value is 2k — 1. Hence, the 

competitive ratio of ALG is at most: 


^ + k+h < k + 1 + 


2k-1 


2k 


k 1 
k + h — 2 ~*~ 


□ 
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A Proof of Theorem 11.11 


In this section we prove Theorem 11.11 Namely, we need to show that no algorithm using no 
preemption is (1/4 + ^-competitive for any constant £ > 0 even when / is guaranteed to be a 
cut function of a weighted directed graph and there is no constraint on the set of elements that 
can be picked. Assume for the sake of contradiction that there exists such an algorithm ALG for 
some, fixed, e > 0. We design an adversary that constructs an instance for ALG that leads to a 
contradiction. The adversary uses a ground set JV with n = n(e) elements (where n(e) is function 
that depends only on e and is described later). 

Each time an element iq is revealed the adversary uses the properties of ALG to choose the 
weighted set Ei of arcs that leaves iq. The objective function / is defined as the weighted cut 
function of the graph (A f, U-/=i A)). Notice that any query of / made by ALG before Ui is revealed 
does not depend on Ei, and thus, the behavior of ALG up to the point when Ui is revealed can be 
described before Ei, E l+ \ ,..., E n are determined. An exact description of the adversary is given 
as Algorithm |4j For ease of notation, we denote by A/) the set {uj \ 1 < j < i} for every 0 < i < n. 


Algorithm 4: Adversary for ALG 

1 Let m i— 1 + 4/e, no i— |" 8 /e] and n = |~e 8 / £ • no]. 

2 Let Mode <— A. 


for i = 1 to n do 
if Mode = A then 

Reveal element Ui with arcs of weight m l going to all nodes revealed so far. 

else // If Mode = B 

Reveal element Ui with no exiting arcs. 


8 
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10 

11 

12 

13 


for every set S C V)_i do 

Let A^g be the event that the algorithm accepted exactly the elements of S before iq 
is revealed. 

Let pi t g be the probability of A,; 5 . 

Let qi t g be the probability Ui is accepted given 


if i > n 0 and 
Mode •(— B. 


Pi,S ■ di,S ■ 



< 1/4 + e/4 then 


Observe that Algorithm [4] operates in two modes. In mode A every revealed element has an arc 
going to every previously revealed element. In mode B the revealed elements have no exiting arcs 
(i.e., their output degrees are 0). The adversary starts in mode A and then switches permanently 
to mode B when some condition holds for the first time. 

Lemma A.l. The adversary given by Algorithm^never switches to mode B. 

Proof. Assume for the sake of contradiction that the adversary switches to mode B immediately 
after element iq is revealed. Let us upper bound the weight of the cut C produced by ALG in this 
case. If ALG accepted a set S C S',_ 1 of elements before Ui is revealed, then the expected number 
of arcs leaving Ui crossing C is {i — 1) ■ • (l — j. By the linearity of the expectation, the 
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expected number of arcs leaving u t crossing C is: 


E 

SCMi-! 


Pi,s ■ <a,s 



<(*-!)• (1/4 +e/4) . 


The total weight of all arcs leaving elements other than tq is: 


i— 1 


i -1 


oo 


— 1) • mP] < (i — 1) • ^ < (i — 1) • ^ m l 1 J 

j=i j =1 3=0 


(i-1) 


m 


m — 1 


(i — 1 )m l ■ 


£ 

4 ' 


Hence, even if all the above arcs cross C, the total weight of C is still less than (i — l)m l -(l/4+e/2). 
On the other hand, the optimal solution may accept m and no other elements, which results in a 
cut of weight (i — 1 )m l . Hence, the competitive ratio of ALG cannot be 1/4 + e. □ 


Let us denote iys = 1 — • Using this notation the previous lemma implies that for every 

no < i < n: 

E ( Pi > s ' %S ' ti,s) > 1/4 + e/2 . 

SCAfi -1 

Consider the potential function 4>(i) = {pi,S ■ tfgj. 

Observation A.2. For every 1 < i < n — 1, 

'[(i-l)U i S + (l-%,s)] 2 2 


$(* + !) - $(*) = I] 

SCA/i-i 


Pi, s 


— t: 


i,S 


E («, 

SCAfi_ i 


s • 


(i - + 2(* - i)tys(i - Qi,s) + (i - qi,sY z 


Proof. Let B{ be the expected set of elements accepted by ALG immediately after Ui is revealed. 
Notice that: 


<J>(i + 1) = E 


1 - 


I BA 


E ( Pi ’ s ’ E 

SCAfi_ i \ 


1 - 


I B; I 


A, 


i,S 


Observe also that E[|i?j| | Ays] = E[|5| + q- lt s \ Ays], Hence, by the linearity of the expectation. 

2 


$(* + !) = E (^,s- E 

SCMi -1 V 


1 - 


|5| + <?ys 


A 


ys 


E 

SCMi -1 \ 


(1 - iys)(* - 1) + gys 


E 

SCjVi-1 


PyS • 


*ys(* - 1) + (1 - Qi,s) 


The observation follows by combining the above equality with the definition of 
Using the above observation, we can now prove the following lemma. 


□ 
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Lemma A. 3. For every no < i < n — 1, 4>(z + 1) — 4>(i) < —e/(4 i). 
Proof. By Observation IA.21 


<4>(i + 1) - $(z) 


E U’S 

SCMi-i \ 


(1 - 2 i)t 2 iS + 2 (i - 1)^(1 - q^s) + (1 


-* ' t'ls + i • *i,S • (! - 9i,s) + l\ 

2 _ 2 

v • E 0 m ■ ti,S ■ (1 - Qi,S - U,S )) + ^2 • 

* SCM-i * 



%,s) 2 


Notice that t i>s ■ (1 - qi,s ~ U,S ) = U,S ■ (1 - U t s) - ti,s ■ %,s < 1/4 — £ ijS • Plugging this inequality 
into the previous one yields: 


$(i + 1)-$(*)<? • V (pi, s ■ (1/4- t i>s ■ qi,s)) + 4 = “ ’ 

l z —' Z z l 

SCAfi-1 


1/4 - E PLS ■ • %,S 

SCA/i_i 


+ 


2 r ; / , . 2 £ 2 £ £ £ 

< - • [1/4 - (1/4 + e/4)] + - = 

where the third inequality holds since i > no > 8/e. 

Corollary A.4. 4>(n) < 4>(no) — 2. 

Proof. By Lemma I A .31 


□ 


71—1 71—1 

4>(n) - 4(n„) = £>(i + 1) - *(i)] <~T.fi 

i—n o Z—TT-o 

c* /'TT. J _ _ 

<--•/ — = -- ■ ln(n/n 0 ) < -- • ln(n 0 • e 8/e /n 0 ) = -2 . □ 

4 Jn 0 x 4 4 

Corollary IA.4I leads to an immediate contradiction since 4> can only take values from the range 

[ 0 , 1 ]- 

B A Polynomial Time Implementation of Algorithm |T] 

In this section we prove the second part of Theorem 11.21 by giving a polynomial time implemen¬ 
tation of Algorithm [T] having the same competitive ratio up to an arbitrary constant e > 0. Our 
implementation is given as Algorithm [5] This algorithms uses the notation Tg(x), where x is a 
vector and 8 £ [0,1]. This notation stands for {u \ x u > 9}, i.e., the set of all elements whose 
coordinate in x is at least 9. 

Our first objective is to show that Algorithm [5] is an online algorithm according to the unconst¬ 
rained-model. 

Lemma B.l. For every 1 < i < n, Si C 5)_i + m. 
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Algorithm 5: Marginal Choice - Polynomial Time Implementation 


1 foreach element Ui revealed do 

2 Choose a uniformly random threshold € [0,1]. 

3 Let Mi 4- {u\,U 2 , ■ ■ ■, Ui}. 


4 

5 

6 
7 


Let Vi be a collections of [3072 • e~ 2 i 4 ln[2e _1 (4i) 2 ]] random vectors from [0,1]-^. 
for j = 1 to i — 1 do 

for every vector v € Vj do 

Extend v to be a vector of [0, l]^"* by setting v Ui to be an independent uniformly 
random value from [0,1]. 


8 

9 

10 


for j = 1 to i do 

|_ m i <- IVjl -1 • Y,v&> j [f( T 0i( v ) + u j) - KToM ~ U J')]- /* m j ~ ®ujF(Qj - Mi). 
Let Si 4— {uj € Ni | rrij > 0}. 


*/ 


Proof. For i = 1 the claim is trivial since Si can contain only u\. Thus, we assume from now on 
* > 1. For every 1 < j < i — 1, let mj and Vj denote the value mj and the set Vj after iteration 
i — 1, and let ml and Vj denote these entities after iteration i. 

By dehnition, Si contains only elements of Af t . Fix an element Uj € A/[ — it*. Then: 


E„eV'. [/( T ^ ( u ) + u i) ~ f(T Bj (v) - Uj)} 

Uj € $ =► -i-|^-f- 

Zuev* t/( T ^ ( u ) + u i) ~ /( r C' ( u ) “ «i)] 


= m'j > 0 

= mj > 0 => Uj € Sj_i 


where the second derivation follows from submodularity since Vj is produced from Vj by extending 
vectors. □ 


To bound the loss in the competitive ratio of Algorithm [5] we need the following claims. 

Observation B.2. For every 1 < i < n, each set in V* is an independent sample of [0, l]-^. 

Proof. Follows immediately by the way Algorithm [5] constructs Vi and the way this collection is 
updated when new elements arrive. □ 


Lemma B.3. Let Xi,X 2 , ... ,X( be independent random variables such that for each i, Xi € [—1,1]. 
Let X = j i Xi an d Ai = E[X]. Then 


Pr[A > (jl + a] < 




for every a > 0. 

Proof. For every 1 < * < £, let Yi = (1 + Xf)/ 2. Additionally, let Y = ]Cj =1 Vj. Clearly, Y % G [0,1]. 
Hence, by the Chernoff bound: 


Pr[X > fi + a] = Pr[2 Y/l - 1 > 2E[V]/f - 1 + a] = Pr 


Y > E[y] + f 


< e 


K[Y](ul/(2-W.[Y})) 2 


= e 12 - E l v ] < e“ 
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and 


Pr[X <n-a\= Pr[2 Y/i - 1 < 2E [Y]/i - 1 - a] = Pr 


y < E[y] 


< e 


E[y](o;i/(2-E[y])) 2 


= e < e“ 


□ 


Let 777 ,( 2 ) be the value that rn t gets after the last iteration of Algorithm [5] if the random variable 
Oi is z. 

Lemma B.4. For every 1 < i < n and z € [0,1], Pr[|mj(z) — d Ui F(z • AT)] > e(4i) -2 • f(OPT )] < 
,:( 1 /) 2 . 

Proof. Observe that to,; is obtained by averaging |V,| independent samples of f(R(z -Af) + Ui ) — 
f(R(z-Af) — Ui), and the expectation of the last expression is d Ui F(z-Af). Additionally, observe that 
every sample belongs to the range [—f(OPT),f(OPT)\ since f(OPT) = ma x-scss f(S). Hence, by 
Lemma IB. 31 


-|V3P(4Q- 


Pr[|mi(» - d Ui F(z • AO I > e(4?:)' 2 ■ f(OPT)\ < 2e —— 


< 2e" ln[2£ 1(4i)2] = e(4 i) 


-2 


□ 


Fix the vectors V = (Vi, V 2 ,..., V n ). For every element iq € A f, mi{z ) is continuous non¬ 
increasing function of z by submodularity. Hence, by the intermediate value theorem, one of the 
following must hold: 771 ,( 2 ) is always positive in the range [0,1], nii(z) is always negative in the 
range [0,1] or 777 ,( 2 ) has at least one root zq € [0,1]. In the last case, the set Jo C [0,1] of the 
roots of m t (z) is non-empty. Moreover, by the monotonicity and continuity of rrijfz), Io is a closed 
range. Using these observations, we define a vector y(V) € [0, l]^" as follows: 


yiyy 


0 if 777,(0) < 0 , 

< 1 if 777,(1) > 0 , 

max Jq otherwise . 


Let £, be the event that d Ui F(z-Af) > — e(4z) 2 • f{OPT) for every 2 < y(V) Ui and d Ui F(z-Af) < 
e(4 i)~ 2 • f(OPT) for every 2 > y(V) Ui . 

Lemma B.5. For every 1 < i < n, the event Si holds with probability at least 1 — 2e(47)~ 2 . 

Proof. Let us first bound the probability that d Ui F(z-Af) > —e(41)^ 2 ■ f{OPT) for every 2 < y(V) Ui - 
If d Ui F(z - AT) is always at least —e(4 i)~' 2 ■ f(OPT), then there is nothing to prove. Otherwise, 
if d Ui F(z ■ Af) is always at most —e(4i) -2 • f(OPT ), then d Ui F(0) < 0, which guarantees that 
y(y) Ui = 0, and thus, makes the claim that we want to prove void. Hence, it is enough to consider 
the case in which d Ui F(z ■ Af) is at least —e(4?') -2 • f(OPT) for some value of 2 and at most 
—e(4?') -2 • f(OPT ) for another value of 2 . In this case the intermediate value theorem and the 
continuity of d Ui F(z-Af) imply the existence of a value y\ such that d Ui F(y^-Af ) = —e(4 i)~ 2 -f(OPT). 
By Lemma IB. 41 777 ^( 7 /)) < 0 with probability at least 1 — £(4?') -2 , which implies y(V) Ui < y[, and 
thus: 

d Ui F(y(y) Ui ■ Af) > d Ui F(y’-Af) = -e(4 i)~ 2 • f(OPT) . 

Observe that the last inequality implies that d Ui F(z-Af) > — e(4i)~ 2 • f(OPT) for every 2 < y(V) Ui . 

A similar argument shows that d Ui F(z ■ Af) < e(4i)^ 2 • f(OPT) for every 2 > y(V) Ui with 
probability at least 1 — £(4iy 2 . The lemma now follow by the union bound. □ 

Let S be the event that Si holds for every 1 < i < n at the same time. 
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Corollary B.6. £ happens with probability at least 1 — e/2. 
Proof. Combining Lemma IB. 5 1 and the union bound gives: 


Pr[f]>l-^2e(4i)- 2 >l-| 



□ 


It is important to notice that the event £ depends only on the vectors V. Thus, fixing the 
vectors V does not affect the distribution of 9\, 02, ■ ■ ■, 9 n . Next, we analyze the competitive ratio 
of Algorithm [5] assuming V is fixed in a way that makes the event £ hold. The next observation 
corresponds to Observation 13.21 

Observation B.7. For every given vectors V, every element Ui € J\T belongs to S n with probability 
y(V) Ui , independently. Hence, E[/(5' ri )] = F(y(V)). 

Proof. An element tq € A f belongs to S n if and only if ) > 0, which is equivalent to the 
condition 9i < y(V) Ui since rrii is a non-increasing function. Clearly, the last condition happens 
with probability y(V) Ui , and is independent for different elements. □ 

The last observation implies that analyzing Algorithm [5] is equivalent to lower bounding F(y(V)). 
The next lemma and corollary correspond to Lemma 13.31 and Corollary 13.51 respectively. 

Lemma B.8. For every given vectors V for which the event £ holds and for every A € [0,1], 
F(y(V) A (A • AO) > F(X ■ AT) — £ ■ £”=1 (4i)“ 2 • f(OPT). 

Proof. For ease of notation, we use y as a shorthand for y(V) in this proof. Observe that: 

F(y A (A • AO) = /(0) + [ X dF ^^j z -^ dz = /(0 ) + [ X V d u F(y A (z ■ M))dz , 

z<yu 


where the second equality is due to the chain rule. By submodularity and the observation that £ 
implies d Ui F(z ■ Af) < e(4 i)~ 2 ■ f(OPT) for every z > y Ui , we get: 


9 u F(y A (z ■ A/")) > 9 u F(z ■ M) > Y, duF ^ z Y ^~ 2 ' f( OPT ) 

u£j\f u£j\f u£j\f i= 1 

z<y u z<y u 


Combining the above equality and inequality, and using the chain rule again, gives: 

r X 


F{y A (A • A/")) > / (0) + 

= /( 0 ) + 


YduF(z-Af) 

' 0 UeAT 

f x dF(z ■ A0 

/ o 


dz 


dz — e ■ X^(40 _2 • f{OPT) 
1=1 
n 

dz — e ■ X^(4i)~ 2 ' f{OPT) 


1=1 


= F(A • J\f) — e ■ X^(4i) -2 • f(OPT) 


□ 


1=1 


Corollary B.9. For every given vectors V for which the event £ holds, F(y(V)) > (e 1 — e/2) • 
F(OPT). 
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Proof. Again, we use y as a shorthand for y(V) in this proof. Fix two values 0 < a < b < 1. By 
the chain rule: 


F(yA(b-Af))-F(yA(a-Af)) 


dF(y A (z ■ Af)) 
dz 


f 

J a 

f J2d u F(yA(z-Af))dz> f J2 d u F(z-Af)dz , 

J a r..r-\r J a n .r- \r 


(4) 


usa r 

z< y u 


uGJ\T 

z<y u 


where the inequality follows from submodularity. We use two ways to lower bound the rightmost 
hand side of the above inequality. First, observe that d Ui F(z - AT) > —e(4 i)~ 2 ■ f(OPT) whenever 
z < y Ui . Thus, F(y A ( b ■ Af)) — F(y A (a • Af)) > —e(b — a) ■ Li(4?’) -2 • f(OPT). Additionally, 

d u F(z ■ Af) < e(4 i)~ 2 ■ f(OPT) whenever z > y u and d u F(z ■ Af) > —£(4 i)~ 2 ■ f(OPT) whenever 
z < y u . This allows us to derive the second lower bound, which holds whenever z is not equal to 
any coordinate of y (i.e., for every value of z except for a finite set of values): 

n 

J2 d uF(z-Af)> J2 d u F(z-Af)-£-J2^i)~ 2 ■ f(OPT) 

uGAf uGOPT i= 1 

z<y u 

= ^[n»vM))-^^)] _ e .^ (4i) -2. f{OPT) 

1=1 

> F( 0 f TV(,W))-F(,W) _ £ , £ 2 . 

1 — z 

i =1 

> f(OPT) - F ^^-N)) _ £ .f 1 + _M . ^(4i)- 2 • f(OPT) , 

' ' 1=1 

where the last inequality follows from Lemmata IB.8I and 13.41 Plugging this lower bound and a = 0 
into (jU) gives: 


F(yA(b-Af)) 

> m + f 

Jo 


f(OPT) - 


F(y A (z - Af)) 


1-z 

The solution of this differential equation is: 


— £ • I 1 + 


1-6 


J^(4i)- 2 ■ f(OPT) 


1=1 


dz 


F{y A (6 • Af)) > -(1 - 6) • ln(l - 6) ■ (l - e ■ (l + ' /( OPT ) • 


Choosing 6 = 1 — e \ we get: 


F(y) = F(y A (1 • Af)) > F(y A ((1 - e" 1 ) • Af)) - £e" 1 • ^(4i)- 2 ■ f(OPT) 

1=1 


> e_1 yi-4e- E( 4i )' 2 j ' /(° PT ) 

> (e- 1 -4£-^(4f)- 2 ] -f(OPT) 


— ee 


-l 


i =1 


E(4^)' 2 • f (° PT ) 

2—1 
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The corollary now follows by observing that: 


4 ■ ^ (4 i 


r 2 > i + 4 


i=1 


s: 


dx 

(4x) : 


1 

4x 


1 1 1 

< 4 + 4 _ 2 


□ 


We are now ready to prove the competitive ratio of Algorithm [5l and complete the proof of 
Theorem 11.21 

Lemma B.10. Algorithm 0 has a competitive ratio of at least e -1 — e. 

Proof. By Observation IB.71 the expected value of the set produced by Algorithm [5] is E[F(y(V))]. 
By the law of total expectation: 


E[F(y(V))] > Pr[£] • E[F(y(V)) | S] > (1 - e/2) • (e" 1 - e/2) • f(OPT) > (e" 1 * - e) ■ f(OPT) , 
where the second inequality holds by Corollaries IB.61 and IB.91 □ 


C A Polynomial Time Implementation of Algorithm [3] for the Ob¬ 
jective g(S) 

In this section we present a polynomial time implementation of Algorithm [3] that can be used to 
get a set S obeying g(S) > (56/627 — e) • f(OPT), for any constant e > 0, and thus, together with 
the rounding described in Section 21 proves the second part of Theorem 11.51 The polynomial time 
implementation is given as Algorithm [6l 


Algorithm 6: Greedy with Theshold for Non-monotone Functions - Polynomial Time Implemen¬ 
tation^,p) 


1 Let So t— 0. 

2 foreach element Ui revealed do 

3 if i < pk then 

4 I— Let Si < — Si-i + Ui. 


5 

6 
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else 

for each u € S,_i do 
Approximate 


8 

9 

10 

11 

12 


m u>i = g(Si -1 + Ui - u'f) - (1 -Pc/ (pk)) ■ g(Si- 1 ) 

= lE[/(R(p _1 (Sj_i + Ui - u'i))) - (1 + c/(pk)) ■ /(R(p _1 • Si-t))] 

using £ = |"50000e -2 z 6 (p + c) 2 \n[2e^ 1 pk(2i) 2 \\ samples. 

Let vf be the element of S,;_i maximizing m u > j. 
if m u > i > 0 then 
|_ Let Si «- Si-i + Ui - u[. 

else 

| Let Si 4— Si-i . 
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Lemma C.l. Every value m U: i calculated by Algorithm 0 obeys: 


Pr[\m U}i -g(S i _i+u i -u , i ) + (l + c/(pk))-g(S i _i)\<£(Ai) 3 • f(OPT)} < ep 1 k 1 (2i) 2 . 

Proof. The value m Ut i is calculated by averaging t samples. For every set S of size at most pk, 
submodularity implies f(S ) < p ■ f(OPT). Hence, each sample must belong to the range [—(p + 
c/k) ■ f(OPT),p ■ f(OPT)}. Thus, by Lemma, I tOI 

Pr[| m U)i - g(Si_i +Ui~ uf) + (1 + c/(pk)) • g(Si-i)\ < e(4*) -3 • f(OPT )] < 2e ^ 

< 2e _ln[2£_lpfc ( 2 *) 2 ] = ep~ 1 k~ l (2i)~ 2 . □ 


Corollary C.2. With probability at least 1 — e/2, all the values m u i calculated by Algorithmic are 
accurate approximations up to an error o/e(4i)~ 3 • f(OPT). 

Proof. For every 1 < i < n, Algorithm [6] approximates at most pk different m Uy i values. Hence, 
by the union bound, the probability that all these approximations are correct up to an error of 
e(4i)~ 3 ■ f(OPT ) is at least: 


n 


1 — pk ■ (ep 

i= 1 


k 1 (2i) 2 ) = 1 - e-^(2F 


-2 


i =1 


- 1 4 6 


dx _ e e 

— = 1 — 7 + ~ A — 
4x 4 4x 


n > 1 - - 
1 2 


□ 


Let £ be the event that Corollary 1C.21 holds. As in Section [4j for every 0 < i < n we define 
Ai = U* = 1 Si/. The following two lemmata correspond to Lemmata 14.41 and 14.51 

Lemma C.3. Assuming £ holds, then for every pk < i < n, g(Si -1 + Ui — u'f) — g(Si- 1) > g(ui \ 
Ai-\) — g(Si-\)/(pk) — 2e(4i)~ 3 ■ f (OPT). 

Proof. By the choice of u( and an averaging argument, 


g(Si- 1 +Ui- u’i) - g(Si-i) > + c/ (pk) ■ g(S i - 1 ) - e(4i) 3 • f(OPT) 

E M 'g Si-! + °/{P k ) • 9(Si-i) - e(4i)~ 3 • f(OPT)} 

pk 

[9(Si-i +Uj- u') - g(5 i _ 1 ) - 2e(4i)~ 3 • f(OPT )] 
pk 


> 


> 


> E u ' eSi .M S ^ +u i)-9iSi-i)] + Eu'eS..MS l . 1 -n')-g(S^ 1 )} _ 2£ ^- 3 . /(OPT) 


pk 


pk 


where the last inequality follows from submodularity. The first term on the rightmost hand side is 
g(ui | Si- 1 ), which can be lower bounded by g(m \ 1 ) because S',;_ 1 C 1 . The second term 

on the rightmost hand side can be lower bounded using the submodularity and non-negativity of g 
as follows: 

Su'es,-! [9(Si-1 ~ U 1 ) - g(Sj- 1 )] > g(0) - g(S i _ 1 ) > g(Sj-i) Q 

pk — pk ~ pk 


The following lemma relates Si and A,. 


Lemma C.4. Assuming £ holds, then for every 0 < i < n, g(Si) > • g(Af) — 2e ■ i(4j) 3 

f(OPT) > g(Ai) - e/4 • f(OPT). 
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Proof. The second inequality holds since (4j) 3 = 0 for z = 0 and for i > 0: 

„ ,, .. _o 1 P dx 11 

j) <^ + j 32x3 = 32 “ 64^2 
i =i 1 

Hence, in the rest of the proof we concentrate on proving the first inequality. 

For z = 0, the inequality clearly holds since g(Ap) = g(0 ) = g(So). Thus, it is enough to prove 
that for every 1 < i < n, g(Sf) - g(Si- 1 ) > ^ • [ g(Ai ) - s/Hj_i)] - 2e(4z ) -3 • /(OPT). If the 
algorithm does not accept Ui then Si = Si-± and Aj = Aj-\, and the claim is trivial. Hence, we 
only need to consider the case in which the algorithm accepts element zq. 

If i < k then g(Si) - g(Si-i) = g{ui \ Si- 1 ) > g{ui | Hj_i) = g{Af) - g{Ai- 1 ), where the 
inequality follows from submodularity since Si-\ C i. If i > k, then it is possible to lower 
bound g(Si) — g(Si- 1 ) = g(Si -1 + Ui — u'f) — g(Si- 1 ) in two ways. First, a lower bound of g(ui \ 
Ai- 1 ) — g(Si-\)/k — 2e(4 i)~ 3 ■ /(OPT) is given by Lemma I4~T1 The second lower bound states that, 
since the algorithm accepts Ui, 

g{Si- i + Ui - u'i) - g(Si- 1 ) > m u ’ v i + c ■ g{S i - 1 )/ ( pk ) - 6(4z )~ 3 • /(OPT) 

> c • ^^/(pfc) - e(4i )- 3 • /(OPT) . 

Using both lower bounds, we get: 

g(Si) - Sf(S;-i) > max / g(ui \ A t - 1 ) - _ 26(4z )~ 3 • f(OPT ) 

( pfc pk J 

> C • I Aj-i) - g(5j_i)/(pfc)] + 1 • [c • g(5,-i)/(pA;)] _ ^ 4^-3 . f(OPT) 

c + 1 

> • < 7 ( 1 * I A-i) - 2e(4i )- 3 • /(OPT) 
c+ 1 

= -4t • b(A) - 5 (A-i)] - 26(4z )" 3 • /(OPT) . □ 

c+ 1 

We also need to show that s/S)) is roughly monotone as a function of i. 

Lemma C.5. Assuming £ holds, then for pk < i < n, g(Si) — g(Si- 1 ) > — e(4z )~ 3 • /(OPT). 
Proof. If Si = Si -1 then the claim is trivial. Otherwise, we must have rn u >. t > 0, and thus, 

g(Si) - giSi-i) > C -^M. _ e(4z ) -3 • /(OPT) > -e(4z )- 3 • /(OPT) . □ 

pk 

Corollary C.6. Assuming £ holds, then for pk < i < n, g(Si ) > g(S n ) — 6 ( 8 z ) 2 • /(OPT). 

Proof. By Lemma 1C. 51 


1 1 1 
< 32 + 64 < 4 ' 


n f n dr 

g(Sn) - g(Si ) > - e ■ £ ( 4i )' 3 • /(° PT ) > - £ • J. 64^3 • f(OPT ) 


j=i +1 

1 


= 6 


128x 2 


/(OPT) > - 


128z 2 


/(OPT) > - 


(8z)^ 


/(OPT) . 


Using the above claims, it is possible to get a guarantee on g(S n ) assuming £ holds, 
following corollary corresponds to Corollary 14.71 


□ 

The 
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Corollary C.7. Assuming £ holds, Algorithm^ produces a set S n such that: 

«U2(rrwnQ-t)-'< opr >- 

\{c + 1)(1 +p 1 c) 2 J 

Proof. Consider an element Ui £ OPT \ Aj. Since U{ was rejected by Algorithm [6l we must have 
rn u i j < 0, which implies: 

c- g{Si- 1 ) 


pk 


+ e(4 i) 3 • f{OPT) > g(Si-i + Ui - u\) - g(Si- 1 ) 


> g( Ui | Aj-i) - - 2e(4i)- 3 • /(OPT) , 


where the second inequality follows from Lemma 1C.31 Rearranging yields: 

9 (ui | ^ ^ • g(Si-!) + 3e(4 i)~ 3 ■ f(OPT) < • g(S n ) + 4e(8 i)~ 2 • f(OPT) , 

where the last inequality uses Corollary IC.61 By the submodularity of g and Lemma 1C.41 we now 
get: 

g(A n U OPT ) < g(A n ) + ^ g(ui \ A n ) 

ueOPT\A n 

c + 1 


< g(A n ) + 

ueOPT\A n 


pk 


\ u 

g(S n )) +4e-^(8 i)~ 2 -f(OPT) 

' i =1 


< g(A n )+p~\c + 1) • g(S n ) + 4e • ^(8i)" 2 • f(OPT) 

2—1 

C + 1 6 


/ (c+ 1)(1 +p : c) , , 

<- g{S n ) + 


Observe now that: 

n 


/ T 

■i=i 


c 4 




f(OPT) + 4e • 5> i)~ 2 -f(OPT) . 


1 


16x 2 16 16s 


2—1 


£ 1 £ 

<- 1 -< - , 

“16 16 4 ’ 


and, by Lemma 14.61 

g(A n U OPT) = F{p~ l ■ OPT + p~ l • (A n \ OPT)) > p~\ 1 - p~ l ) ■ f(OPT) 
Combining the three last inequalities yields: 


— l/i — 1 ' c+1 £ 


p-\l-p-^)- 


c 2 


•/(OPT)< (c+1)(1+ ^ lc) - ff (5„ 


The corollary now follow by dividing the last inequality by C+BR+p —— and observing that 1 
p~ 1 c > 1. 

Corollary C.8. Algorithm^ produces a set S n such that: 

Ws.)]z( Xw + rl) -‘)- flOFT) ■ 

Hence, for c = 7/4 and p = 3, g(S n ) > (56/627 — e) • f(OPT). 
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Proof. By the law of total expectation, 


na(S n )} > Pr[£] • E[g(S n ) I E] > (1 - e/2) 
cp~ 1 ( 1 — p 1 ) 


cp X (1 — p 


> 


— e 


(c+ 1)(1 +p x c) 
f(OPT) , 


(c + 1)(1 +p _1 c) 
where the second inequality holds by Corollaries 1C.21 and 1C. 71 


) • fiOPT) 

□ 
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